Microrna molecules

ABSTRACT

In  Caenorhabditis elegans , lin-4 and let-7 enclode 22- and 21-nucleotide RNAs, respectively, that function as key regulators of developmental timing. Because the appearance of these short RNAs is regulated during development, they are also referred to as “small temporal RNAs” (stRNAs). We show that many more 21- and 22-nt expressed RNAs, termed microRNAs, (miRNAs), exist in invertebrates and vertebrates, and that some of these novel RNAs, similar to let-7 stRAN, are also highly conserved. This suggests that sequence-specific post-transcriptional regulatory mechanisms mediated by small RNAs are more general than previously appreciated.

This Application is a divisional of U.S. Ser. No. 11/747,409 filed May11, 2007, which is a divisional of U.S. Pat. No. 7,232,806 issued Jun.19, 2007, which is a 371 of International Application PCT/EP2002/10881filed Sep. 27, 2002, the disclosure of which is incorporated herein inits entirety by reference.

The present invention relates to novel small expressed (micro)RNAmolecules associated with physiological regulatory mechanisms,particularly in developmental control.

In Caenorhabditis elegans, lin-4 and let-7 encode 22- and 21-nucleotideRNAs, respectively (1, 2), that function as key regulators ofdevelopmental timing (3-5). Because the appearance of these short RNAsis regulated during development, they are also referred to as“microRNAs” (miRNAs) or small temporal RNAs (stRNAs) (6). lin-4 andlet-21 are the only known miRNAs to date.

Two distinct pathways exist in animals and plants in which 21- to23-nucleotide RNAs function as post-transcriptional regulators of geneexpression. Small interfering RNAs (siRNAs) act as mediators ofsequence-specific mRNA degradation in RNA interference (RNAi) (7-11)whereas miRNAs regulate developmental timing by mediatingsequence-specific repression of mRNA translation (3-5). siRNAs andmiRNAs are excised from double-stranded RNA (dsRNA) precursors by Dicer(12, 13, 29), a multidomain RNase III protein, thus producing RNAspecies of similar size. However, siRNAs are believed to bedouble-stranded (8, 11, 12), while miRNAs are single-stranded (6).

We show that many more short, particularly 21- and 22-nt expressed RNAs,termed microRNAs (miRNAs), exist in invertebrates and vertebrates, andthat some of these novel RNAs, similar to let-7 RNA (6), are also highlyconserved. This suggests that sequence-specific post-transcriptionalregulatory mechanisms mediated by small RNAs are more general thanpreviously appreciated.

The present invention relates to an isolated nucleic acid moleculecomprising:

-   -   (a) a nucleotide sequence as shown in Table 1, Table 2, Table 3        or Table 4    -   (b) a nucleotide sequence which is the complement of (a),    -   (c) a nucleotide sequence which has an identity of at least 80%,        preferably of at least 90% and more preferably of at least 99%,        to a sequence of (a) or (b) and/or    -   (d) a nucleotide sequence which hybridizes under stringent        conditions to a sequence of (a), (b) and/or (c).

In a preferred embodiment the invention relates to miRNA molecules andanalogs thereof, to miRNA precursor molecules and to DNA moleculesencoding miRNA or miRNA precursor molecules.

Preferably the identity of sequence (c) to a sequence of (a) or (b) isat least 90%, more preferably at least 95%. The determination ofidentity (percent) may be carried out as follows:

I=n:L

wherein I is the identity in percent, n is the number of identicalnucleotides between a given sequence and a comparative sequence as shownin Table 1, Table 2, Table 3 or Table 4 and L is the length of thecomparative sequence. It should be noted that the nucleotides A, C, Gand U as depicted in Tables 1, 2, 3 and 4 may denote ribonucleotides,deoxyribonucleotides and/or other nucleotide analogs, e.g. syntheticnon-naturally occurring nucleotide analogs. Further nucleobases may besubstituted by corresponding nucleobases capable of forming analogousH-bonds to a complementary nucleic acid sequence, e.g. U may besubstituted by T.

Further, the invention encompasses nucleotide sequences which hybridizeunder stringent conditions with the nucleotide sequence as shown inTable 1, Table 2, Table 3 or Table 4, a complementary sequence thereofor a highly identical sequence. Stringent hybridization conditionscomprise washing for 1 h in 1×SSC and 0.1% SDS at 45° C., preferably at48° C. and more preferably at 50° C., particularly for 1 h in 0.2×SSCand 0.1% SDS.

The isolated nucleic acid molecules of the invention preferably have alength of from 18 to 100 nucleotides, and more preferably from 18 to 80nucleotides. It should be noted that mature miRNAs usually have a lengthof 19-24 nucleotides, particularly 21, 22 or 23 nucleotides. The miRNAs,however, may be also provided as a precursor which usually has a lengthof 50-90 nucleotides, particularly 60-80 nucleotides. It should be notedthat the precursor may be produced by processing of a primary transcriptwhich may have a length of >100 nucleotides.

The nucleic acid molecules may be present in single-stranded ordouble-stranded form. The miRNA as such is usually a single-strandedmolecule, while the mi-precursor is usually an at least partiallyself-complementary molecule capable of forming double-stranded portions,e.g. stem- and loop-structures. DNA molecules encoding the miRNA andmiRNA precursor molecules. The nucleic acids may be selected from RNA,DNA or nucleic acid analog molecules, such as sugar- orbackbone-modified ribonucleotides or deoxyribonucleotides. It should benoted, however, that other nucleic analogs, such as peptide nucleicacids (PNA) or locked nucleic acids (LNA), are also suitable.

In an embodiment of the invention the nucleic acid molecule is an RNA-or DNA molecule, which contains at least one modified nucleotide analog,i.e. a naturally occurring ribonucleotide or deoxyribonucleotide issubstituted by a non-naturally occurring nucleotide. The modifiednucleotide analog may be located for example at the 5′-end and/or the3′-end of the nucleic acid molecule.

Preferred nucleotide analogs are selected from sugar- orbackbone-modified ribonucleotides. It should be noted, however, thatalso nucleobase-modified ribonucleotides, i.e. ribonucleotides,containing a non-naturally occurring nucleobase instead of a naturallyoccurring nucleobase such as uridines or cytidines modified at the5-position, e.g. 5-(2-amino)propyl uridine, 5-bromo uridine; adenosinesand guanosines modified at the 8-position, e.g. 8-bromo guanosine; deazanucleotides, e.g. 7-deaza-adenosine; O- and N-alkylated nucleotides,e.g. N6-methyl adenosine are suitable. In preferred sugar-modifiedribonucleotides the 2′-OH-group is replaced by a group selected from H,OR, R, halo, SH, SR, NH₂, NHR, NR₂ or CN, wherein R is C₁-C₆ alkyl,alkenyl or alkynyl and halo is F, Cl, Br or I. In preferredbackbone-modified ribonucleotides the phosphoester group connecting toadjacent ribonucleotides is replaced by a modified group, e.g. ofphosphothioate group. It should be noted that the above modificationsmay be combined.

The nucleic acid molecules of the invention may be obtained by chemicalsynthesis methods or by recombinant methods, e.g. by enzymatictranscription from synthetic DNA-templates or from DNA-plasmids isolatedfrom recombinant organisms. Typically phage RNA-polymerases are used fortranscription, such as T7, T3 or SP6 RNA-polymerases.

The invention also relates to a recombinant expression vector comprisinga recombinant nucleic acid operatively linked to an expression controlsequence, wherein expression, i.e. transcription and optionally furtherprocessing results in a miRNA-molecule or miRNA precursor molecule asdescribed above. The vector is preferably a DNA-vector, e.g. a viralvector or a plasmid, particularly an expression vector suitable fornucleic acid expression in eukaryotic, more particularly mammaliancells. The recombinant nucleic acid contained in said vector may be asequence which results in the transcription of the miRNA-molecule assuch, a precursor or a primary transcript thereof, which may be furtherprocessed to give the miRNA-molecule.

Further, the invention relates to diagnostic or therapeutic applicationsof the claimed nucleic acid molecules. For example, miRNAs may bedetected in biological samples, e.g. in tissue sections, in order todetermine and classify certain cell types or tissue types ormiRNA-associated pathogenic disorders which are characterized bydifferential expression of miRNA-molecules or miRNA-molecule patterns.Further, the developmental stage of cells may be classified bydetermining temporarily expressed miRNA-molecules.

Further, the claimed nucleic acid molecules are suitable for therapeuticapplications. For example, the nucleic acid molecules may be used asmodulators or targets of developmental processes or disorders associatedwith developmental dysfunctions, such as cancer. For example, miR-15 andmiR-16 probably function as tumor-suppressors and thus expression ordelivery of these RNAs or analogs or precursors thereof to tumor cellsmay provide therapeutic efficacy, particularly against leukemias, suchas B-cell chronic lymphocytic leukemia (B-CLL). Further, miR-10 is apossible regulator of the translation of Hox Genes, particularly Hox 3and Hox 4 (or Scr and Dfd in Drosophila).

In general, the claimed nucleic acid molecules may be used as amodulator of the expression of genes which are at least partiallycomplementary to said nucleic acid. Further, miRNA molecules may act astarget for therapeutic screening procedures, e.g. inhibition oractivation of miRNA molecules might modulate a cellular differentiationprocess, e.g. apoptosis.

Furthermore, existing miRNA molecules may be used as starting materialsfor the manufacture of sequence-modified miRNA molecules, in order tomodify the target-specificity thereof, e.g. an oncogene, amultidrug-resistance gene or another therapeutic target gene. The novelengineered miRNA molecules preferably have an identity of at least 80%to the starting miRNA, e.g. as depicted in Tables 1, 2, 3 and 4.Further, miRNA molecules can be modified, in order that they aresymetrically processed and then generated as double-stranded siRNAswhich are again directed against therapeutically relevant targets.

Furthermore, miRNA molecules may be used for tissue reprogrammingprocedures, e.g. a differentiated cell line might be transformed byexpression of miRNA molecules into a different cell type or a stem cell.

For diagnostic or therapeutic applications, the claimed RNA moleculesare preferably provided as a pharmaceutical composition. Thispharmaceutical composition comprises as an active agent at least onenucleic acid molecule as described above and optionally apharmaceutically acceptable carrier.

The administration of the pharmaceutical composition may be carried outby known methods, wherein a nucleic acid is introduced into a desiredtarget cell in vitro or in vivo.

Commonly used gene transfer techniques include calcium phosphate,DEAE-dextran, electroporation and microinjection and viral methods [30,31, 32, 33, 34]. A recent addition to this arsenal of techniques for theintroduction of DNA into cells is the use of cationic liposomes [35].

Commercially available cationic lipid formulations are e.g. Tfx 50(Promega) or Lipofectamin 2000 (Life Technologies).

The composition may be in form of a solution, e.g. an injectablesolution, a cream, ointment, tablet, suspension or the like. Thecomposition may be administered in any suitable way, e.g. by injection,by oral, topical, nasal, rectal application etc. The carrier may be anysuitable pharmaceutical carrier. Preferably, a carrier is used, which iscapable of increasing the efficacy of the RNA molecules to enter thetarget-cells. Suitable examples of such carriers are liposomes,particularly cationic liposomes.

Further, the invention relates to a method of identifying novelmicroRNA-molecules and precursors thereof, in eukaryotes, particularlyin vertebrates and more particularly in mammals, such as humans or mice.This method comprises: ligating 5′- and 3′-adapter-molecules to the endof a size-fractionated RNA-population, reverse transcribing saidadapter-ligated RNA-population, and characterizing said reversetranscribed RNA-molecules, e.g. by amplification, concatamerization,cloning and sequencing.

A method as described above already has been described in (8), however,for the identification of siRNA molecules. Surprisingly, it was foundnow that the method is also suitable for identifying the miRNA moleculesor precursors thereof as claimed in the present application.

Further, it should be noted that as 3′-adaptor for derivatization of the3′-OH group not only 4-hydroxymethylbenzyl but other types ofderivatization groups, such as alkyl, alkyl amino, ethylene glycol or3′-deoxy groups are suitable.

Further, the invention shall be explained in more detail by thefollowing Figures and Examples:

FIGURE LEGENDS

FIG. 1A. Expression of D. melanogaster miRNAs. Northern blots of totalRNA isolated from staged populations of D. melanogaster were probed forthe indicated miRNAs. The position of 76-nt val-tRNA is also indicatedon the blots. 5S rRNA serves as loading control. E, embryo; L, larvalstage; P, pupae; A, adult; S2, Schneider-2 cells. It should be pointedout, that S2 cells are polyclonal, derived from an unknown subset ofembryonic tissues, and may have also lost some features of their tissueof origin while maintained in culture. miR-3 to miR-6 RNAs were notdetectable in S2 cells (data not shown). miR-14 was not detected byNorthern blotting and may be very weakly expressed, which is consistentwith its cloning frequency. Similar miRNA sequences are difficult todistinguish by Northern blotting because of potentialcross-hybridization of probes.

FIG. 1B. Expression of vertebrate miRNAs. Northern blots of total RNAisolated from HeLa cells, mouse kidneys, adult zebrafish, frog ovaries,and S2 cells were probed for the indicated miRNAs. The position of 76-ntval-tRNA is also indicated on the blots. 5S rRNA from the preparationsof total RNA from the indicated species is also shown. The gels used forprobing of miR-18, miR-19a, miR-30, and miR-31 were not run as far asthe other gels (see tRNA marker position). miR-32 and miR-33 were notdetected by Northern blotting, which is consistent with their lowcloning frequency. Oligodeoxynucleotides used as Northern probes were:

(SEQ ID NO: 1) let-7a, 5′ TACTATACAACCTACTACCTCAATTTGCC; (SEQ ID NO: 2)let-7d, 5′ ACTATGCAACCTACTACCTCT; (SEQ ID NO: 3) let-7e,5′ ACTATACAACCTCCTACCTCA; (SEQ ID NO: 4) D. melanogaster val-tRNA,5′ TGGTGTTTCCGCCCGGGAA; (SEQ ID NO: 5) miR-1, 5′ TGGAATGTAAAGAAGTATGGAG;(SEQ ID NO: 6) miR-2b, 5′ GCTCCTCAAAGCTGGCTGTGATA; (SEQ ID NO: 7) miR-3,5′ TGAGACACACTTTGCCCAGTGA; (SEQ ID NO: 8) miR-4,5′ TCAATGGTTGTCTAGCTTTAT; (SEQ ID NO: 9) miR-5,5′ CATATCACAACGATCGTTCCTTT; (SEQ ID NO: 10) miR-6,5′ AAAAAGAACAGCCACTGTGATA; (SEQ ID NO: 11) miR-7,5′ TGGAAGACTAGTGATTTTGTTGT; (SEQ ID NO: 12) miR-8,5′ GACATCTTTACCTGACAGTATTA; (SEQ ID NO: 13) miR-9,5′ TCATACAGCTAGATAACCAAAGA; (SEQ ID NO: 14) miR-10,5′ ACAAATTCGGATCTACAGGGT; (SEQ ID NO: 15) miR-11,5′ GCAAGAACTCAGACTGTGATG; (SEQ ID NO: 16) miR-12,5′ ACCAGTACCTGATGTAATACTCA; (SEQ ID NO: 17) miR-13a,5′ ACTCGTCAAAATGGCTGTGATA; (SEQ ID NO: 18) miR-14,5′ TAGGAGAGAGAAAAAGACTGA; (SEQ ID NO: 19) miR-15,5′ TAGCAGCACATAATGGTTTGT; (SEQ ID NO: 20) miR-16,5′ GCCAATATTTACGTGCTGCTA; (SEQ ID NO: 21) miR-17,5′ TACAAGTGCCTTCACTGCAGTA; (SEQ ID NO: 22) miR-18,5′ TATCTGCACTAGATGCACCTTA; (SEQ ID NO: 23) miR-19a,5′ TCAGTTTTGCATAGATTTGCACA; (SEQ ID NO: 24) miR-20,5′ TACCTGCACTATAAGCACTTTA; (SEQ ID NO: 25) miR-21,5′ TCAACATCAGTCTGATAAGCTA; (SEQ ID NO: 26) miR-22,5′ ACAGTTCTTCAACTGGCAGCTT; (SEQ ID NO: 27) miR-23,5′ GGAAATCCCTGGCAATGTGAT; (SEQ ID NO: 28) miR-24,5′ CTGTTCCTGCTGAACTGAGCCA; (SEQ ID NO: 29) miR-25,5′ TCAGACCGAGACAAGTGCAATG; (SEQ ID NO: 30) miR-26a,5′ AGCCTATCCTGGATTACTTGAA; (SEQ ID NO: 31) miR-27;5′ AGCGGAACTTAGCCACTGTGAA; (SEQ ID NO: 32) miR-28,5′ CTCAATAGACTGTGAGCTCCTT; (SEQ ID NO: 33) miR-29,5′ AACCGATTTCAGATGGTGCTAG; (SEQ ID NO: 34) miR-30,5′ GCTGCAAACATCCGACTGAAAG; (SEQ ID NO:35) miR-31,5′ CAGCTATGCCAGCATCTTGCCT; (SEQ ID NO: 36) miR-32,5′ GCAACTTAGTAATGTGCAATA; (SEQ ID NO: 37) miR-33,5′ TGCAATGCAACTACAATGCACC.

FIG. 2. Genomic organization of miRNA gene clusters. The precursorstructure is indicated as box and the location of the miRNA within theprecursor is shown in gray; the chromosomal location is also indicatedto the right. (A) D. melanogaster miRNA gene clusters. (B) Human miRNAgene clusters. The cluster of let-7a-1 and let-7f-1 is separated by26500 nt from a copy of let-7d on chromosome 9 and 17. A cluster oflet-7a-3 and let-7b, separated by 938 nt on chromosome 22, is notillustrated.

FIG. 3. Predicted precursor structures of D. melanogaster miRNAs. RNASecondary structure prediction was performed using mfold version 3.1[28] and manually refined to accommodate G/U wobble base pairs in thehelical segments. The miRNA sequence is underlined. The actual size ofthe stem-loop structure is not known experimentally and may be slightlyshorter or longer than represented. Multicopy miRNAs and theircorresponding precursor structures are also shown.

FIG. 4. Predicted precursor structures of human miRNAs. For legend, seeFIG. 3.

FIG. 5. Expression of novel mouse miRNAs. Northern blot analysis ofnovel mouse miRNAs. Total RNA from different mouse tissues was blottedand probed with a 5′-radiolabeled oligodeoxynucleotide complementary tothe indicated miRNA. Equal loading of total RNA on the gel was verifiedby ethidium bromide staining prior to transfer; the band representingtRNAs is shown. The fold-back precursors are indicated with capital L.Mouse brains were dissected into midbrain, mb, cortex, cx, cerebellum,cb. The rest of the brain, rb, was also used. Other tissues were heart,ht, lung, Ig, liver, lv, colon, co, small intestine, si, pancreas, pc,spleen, sp, kidney, kd, skeletal muscle, sm, stomach, st, H, human HelaSS3 cells. Oligodeoxynucleotides used as Northern probes were:

(SEQ ID NO: 38) miR-1a, CTCCATACTTCTTTACATTCCA; (SEQ ID NO: 39) miR-30b,GCTGAGTGTAGGATGTTTACA; (SEQ ID NO: 40) miR-30a-s,GCTTCCAGTCGAGGATGTTTACA; (SEQ ID NO: 41) miR-99b,CGCAAGGTCGGTTCTACGGGTG; (SEQ ID NO: 42) miR-101, TCAGTTATCACAGTACTGTA;(SEQ ID NO: 43) miR-122a, ACAAACACCATTGTCACACTCCA; (SEQ ID NO: 44)miR-124a, TGGCATTCACCGCGTGCCTTA; (SEQ ID NO: 45) miR-125a,CACAGGTTAAAGGGTCTCAGGGA; (SEQ ID NO: 46) miR-125b,TCACAAGTTAGGGTCTCAGGGA; (SEQ ID NO: 47) miR-127, AGCCAAGCTCAGACGGATCCGA;(SEQ ID NO: 48) miR-128, AAAAGAGACCGGTTCACTCTGA; (SEQ ID NO: 49)miR-129, GCAAGCCCAGACCGAAAAAAG; (SEQ ID NO: 50) miR-130,GCCCTTTTAACATTGCACTC; (SEQ ID NO: 51) miR-131, ACTTTCGGTTATCTAGCTTTA;(SEQ ID NO: 52) miR-132, ACGACCATGGCTGTAGACTGTTA; (SEQ ID NO: 53)miR-143, TGAGCTACAGTGCTTCATCTCA.

FIG. 6. Potential orthologs of lin-4 stRNA. (A) Sequence alignment of C.elegans lin-4 stRNA with mouse miR-125a and miR-125b and the D.melanogaster miR-125. Differences are highlighted by gray boxes. (B)Northern blot of total RNA isolated from staged populations of D.melanogaster, probed for miR-125. E, embryo; L, larval stage; P, pupae;A, adult; S2, Schneider-2 cells.

FIG. 7. Predicted precursor structures of miRNAs, sequence accessionnumbers and homology information. RNA secondary structure prediction wasperformed using mfold version 3.1 and manually refined to accommodateG/U wobble base pairs in the helical segments. Dashes were inserted intothe secondary structure presentation when asymmetrically bulgednucleotides had to be accommodated. The excised miRNA sequence isunderlined. The actual size of the stem-loop structure is not knownexperimentally and may be slightly shorter or longer than represented.Multicopy miRNAs and their corresponding precursor structures are alsoshown. In cases where no mouse precursors were yet deposited in thedatabase, the human orthologs are indicated. miRNAs which correspond toD. melanogaster or human sequences are included. Published C. elegansmiRNAs [36, 37] are also included in the table. A recent set of new HeLacell miRNAs is also indicated [46]. If several ESTs were retrieved forone organism in the database, only those with different precursorsequences are listed. miRNA homologs found in other species areindicated. Chromosomal location and sequence accession numbers, andclusters of miRNA genes are indicated. Sequences from cloned miRNAs weresearched against mouse and human in GenBank (including trace data), andagainst Fugu rubripes and Dania rerio at www.jgi.doe.gov andwww.sanger.ac.uk, respectively.

EXAMPLE 1 MicroRNAs from D. melanogaster and Human

We previously developed a directional cloning procedure to isolatesiRNAs after processing of long dsRNAs in Drosophila melanogaster embryolysate (8). Briefly, 5′ and 3′ adapter molecules were ligated to theends of a size-fractionated RNA population, followed by reversetranscription, PCR amplification, concatamerization, cloning andsequencing. This method, originally intended to isolate siRNAs, led tothe simultaneous identification of 14 novel 20- to 23-nt short RNAswhich are encoded in the D. melanogaster genome and which are expressedin 0 to 2 h embryos (Table 1). The method was adapted to clone RNAs in asimilar size range from HeLa cell total RNA (14), which led to theidentification of 19 novel human stRNAs (Table 2), thus providingfurther evidence for the existence of a large class of small RNAs withpotential regulatory roles. According to their small size, we refer tothese novel RNAs as microRNAs or miRNAs. The miRNAs are abbreviated asmiR-1 to miR-33, and the genes encoding miRNAs are named mir-1 tomir-33. Highly homologous miRNAs are classified by adding a lowercaseletter, followed by a dash and a number for designating multiple genomiccopies of a mir gene.

The expression and size of the cloned, endogenous short RNAs was alsoexamined by Northern blotting (FIG. 1, Table 1 and 2). Total RNAisolation was performed by acid guanidiniumthiocyanate-phenol-chloroform extraction [45]. Northern analysis wasperformed as described [1], except that the total RNA was resolved on a15% denaturing polyacrylamide gel, transferred onto Hybond-N+membrane(Amersham Pharmacia Biotech), and the hybridization and wash steps wereperformed at 50° C. Oligodeoxynucleotides used as Northern probes were5′-32P-phosphorylated, complementary to the miRNA sequence and 20 to 25nt in length.

5S rRNA was detected by ethidium staining of polyacrylamide gels priorto transfer. Blots were stripped by boiling in 0.1% aqueous sodiumdodecylsulfate/0.1×SSC (15 mM sodium chloride, 1.5 mM sodium citrate, pH7.0) for 10 min, and were re-probed up to 4 times until the 21-ntsignals became too weak for detection. Finally, blots were probed forval-tRNA as size marker.

For analysis of D. melanogaster RNAs, total RNA was prepared fromdifferent developmental stages, as well as cultured Schneider-2 (S2)cells, which originally derive from 20-24 h D. melanogaster embryos [15](FIG. 1, Table 1). miR-3 to miR-7 are expressed only duringembryogenesis and not at later developmental stages. The temporalexpression of miR-1, miR-2 and miR-8 to miR-13 was less restricted.These miRNAs were observed at all developmental stages thoughsignificant variations in the expression levels were sometimes observed.Interestingly, miR-1, miR-3 to miR-6, and miR-8 to miR-11 werecompletely absent from cultured Schneider-2 (S2) cells, which wereoriginally derived from 20-24 h D. melanogaster embryos [15], whilemiR-2, miR-7, miR-12, and miR-13 were present in S2 cells, thereforeindicating cell type-specific miRNA expression. miR-1, miR-8, and miR-12expression patterns are similar to those of lin-4 stRNA in C. elegans,as their expression is strongly upregulated in larvae and sustained toadulthood [16]. miR-9 and miR-11 are present at all stages but arestrongly reduced in the adult which may reflect a maternal contributionfrom germ cells or expression in one sex only.

The mir-3 to mir-6 genes are clustered (FIG. 2A), and mir-6 is presentas triple repeat with slight variations in the mir-6 precursor sequencebut not in the miRNA sequence itself. The expression profiles of miR-3to miR-6 are highly similar (Table 1), which suggests that a singleembryo-specific precursor transcript may give rise to the differentmiRNAs, or that the same enhancer regulates miRNA-specific promoters.Several other fly miRNAs are also found in gene clusters (FIG. 2A).

The expression of HeLa cell miR-15 to miR-33 was examined by Northernblotting using HeLa cell total RNA, in addition to total RNA preparedfrom mouse kidneys, adult zebrafish, Xenopus laevis ovary, and D.melanogaster S2 cells (FIG. 1B, Table 2). miR-15 and miR-16 are encodedin a gene cluster (FIG. 2B) and are detected in mouse kidney, fish, andvery weakly in frog ovary, which may result from miRNA expression insomatic ovary tissue rather than oocytes. mir-17 to mir-20 are alsoclustered (FIG. 2B), and are expressed in HeLa cells and fish, butundetectable in mouse kidney and frog ovary (FIG. 1, Table 2), andtherefore represent a likely case of tissue-specific miRNA expression.

The majority of vertebrate and invertebrate miRNAs identified in thisstudy are not related by sequence, but a few exceptions, similar to thehighly conserved let-7 RNA [6], do exist. Sequence analysis of the D.melanogaster miRNAs revealed four such examples of sequence conservationbetween invertebrates and vertebrates. miR-1 homologs are encoded in thegenomes of C. elegans, C. briggsae, and humans, and are found in cDNAsfrom zebrafish, mouse, cow and human. The expression of mir-1 wasdetected by Northern blotting in total RNA from adult zebrafish and C.elegans, but not in total RNA from HeLa cells or mouse kidney (Table 2and data not shown). Interestingly, while mir-1 and let-7 are expressedboth in adult flies (FIG. 1A) [6] and are both undetected in S2 cells,miR-1 is, in contrast to let-7, undetectable in HeLa cells. Thisrepresents another case of tissue-specific expression of a miRNA, andindicates that miRNAs may not only play a regulatory role indevelopmental timing, but also in tissue specification. miR-7 homologswere found by database searches in mouse and human genomic and expressedsequence tag sequences (ESTs). Two mammalian miR-7 variants arepredicted by sequence analysis in mouse and human, and were detected byNorthern blotting in HeLa cells and fish, but not in mouse kidney (Table2). Similarly, we identified mouse and human miR-9 and miR-10 homologsby database searches but only detected mir-10 expression in mousekidney.

The identification of evolutionary related miRNAs, which have alreadyacquired multiple sequence mutations, was not possible by standardbioinformatic searches. Direct comparison of the D. melanogaster miRNAswith the human miRNAs identified an 11-nt segment shared between D.melanogaster miR-6 and HeLa miR-27, but no further relationships weredetected. One may speculate that most miRNAs only act on a single targetand therefore allow for rapid evolution by covariation, and that highlyconserved miRNAs act on more than one target sequence, and thereforehave a reduced probability for evolutionary drift by covariation [6]. Analternative interpretation is that the sets of miRNAs from D.melanogaster and humans are fairly incomplete and that many more miRNAsremain to be discovered, which will provide the missing evolutionarylinks.

lin-4 and let-7 stRNAs were predicted to be excised from longertranscripts that contain approximately 30 base-pair stem-loop structures[1, 6]. Database searches for newly identified miRNAs revealed that allmiRNAs are flanked by sequences that have the potential to form stablestem-loop structures (FIGS. 3 and 4). In many cases, we were able todetect the predicted, approximately 70-nt precursors by Northernblotting (FIG. 1).

Some miRNA precursor sequences were also identified in mammalian cDNA(EST) databases [27], indicating that primary transcripts longer than70-nt stem-loop precursors do also exist. We never cloned a 22-nt RNAcomplementary to any of the newly identified miRNAs, and it is as yetunknown how the cellular processing machinery distinguishes between themiRNA and its complementary strand. Comparative analysis of theprecursor stem-loop structures indicates that the loops adjacent to thebase-paired miRNA segment can be located on either side of the miRNAsequence (FIGS. 3 and 4), suggesting that the 5′ or 3′ location of thestem-closing loop is not the determinant of miRNA excision. It is alsounlikely that the structure, length or stability of the precursor stemis the critical determinant as the base-paired structures are frequentlyimperfect and interspersed by less stable, non-Watson-Crick base pairssuch as G/A, U/U, C/U, A/A, and G/U wobbles. Therefore, asequence-specific recognition process is a likely determinant for miRNAexcision, perhaps mediated by members of the Argonaute (rde-1/ago1/piwi)protein family. Two members of this family, alg-1 and alg-2, haverecently been shown to be critical for stRNA processing in C. elegans[13]. Members of the Argonaute protein family are also involved in RNAiand PTGS. In D. melanogaster, these include argonaute2, a component ofthe siRNA-endonuclease complex (RISC) [17], and its relative aubergine,which is important for silencing of repeat genes [18]. In other species,these include rde-1, argonaute1, and qde-2, in C. elegans [19],Arabidopsis thaliana [20], and Neurospora crassa [21], respectively. TheArgonaute protein family therefore represents, besides the RNase IIIDicer [12, 13], another evolutionary link between RNAi and miRNAmaturation.

Despite advanced genome projects, computer-assisted detection of genesencoding functional RNAs remains problematic [22]. Cloning of expressed,short functional RNAs, similar to EST approaches (RNomics), is apowerful alternative and probably the most efficient method foridentification of such novel gene products [23-26]. The number offunctional RNAs has been widely underestimated and is expected to growrapidly because of the development of new functional RNA cloningmethodologies.

The challenge for the future is to define the function and the potentialtargets of these novel miRNAs by using bioinformatics as well asgenetics, and to establish a complete catalogue of time- andtissue-specific distribution of the already identified and yet to beuncovered miRNAs. lin-4 and let-7 stRNAs negatively regulate theexpression of proteins encoded by mRNAs whose 3′ untranslated regionscontain sites of complementarity to the stRNA [3-5].

Thus, a series of 33 novel genes, coding for 19- to 23-nucleotidemicroRNAs (miRNAs), has been cloned from fly embryos and human cells.Some of these miRNAs are highly conserved between vertebrates andinvertebrates and are developmentally or tissue-specifically expressed.Two of the characterized human miRNAs may function as tumor suppressorsin B-cell chronic lymphocytic leukemia. miRNAs are related to a smallclass of previously described 21- and 22-nt RNAs (lin-4 and let-7 RNAs),so-called small temporal RNAs (stRNAs), and regulate developmentaltiming in C. elegans and other species. Similar to stRNAs, miRNAs arepresumed to regulate translation of specific target mRNAs by binding topartially complementary sites, which are present in their3′-untranslated regions.

Deregulation of miRNA expression may be a cause of human disease, anddetection of expression of miRNAs may become useful as a diagnostic.Regulated expression of miRNAs in cells or tissue devoid of particularmiRNAs may be useful for tissue engineering, and delivery or transgenicexpression of miRNAs may be useful for therapeutic intervention. miRNAsmay also represent valuable drug targets itself. Finally, miRNAs andtheir precursor sequences may be engineered to recognize therapeuticvaluable targets.

EXAMPLE 2 miRNAs from Mouse

To gain more detailed insights into the distribution and function ofmiRNAs in mammals, we investigated the tissue-specific distribution ofmiRNAs in adult mouse. Cloning of miRNAs from specific tissues waspreferred over whole organism-based cloning because low-abundance miRNAsthat normally go undetected by Northern blot analysis are identifiedclonally. Also, in situ hybridization techniques for detecting 21-ntRNAs have not yet been developed. Therefore, 19- to 25-nucleotide RNAswere cloned and sequenced from total RNA, which was isolated from 18.5weeks old BL6 mice. Cloning of miRNAs was performed as follows: 0.2 to 1mg, of total RNA was separated on a 15% denaturing polyacrylamide geland RNA of 19- to 25-nt size was recovered. A 5′-phosphorylated3′-adapter oligonucleotide (5′-pUUUaaccgcgaattccagx: uppercase, RNA;lowercase, DNA; p, phosphate; x, 3′-Amino-Modifier C-7, ChemGenes,Ashland, Ma, USA, Cat. No. NSS-1004; SEQ ID NO:54) and a 5′-adapteroligonucleotide (5′-acggaattcctcactAAA: uppercase, RNA; lowercase, DNA;SEQ ID NO:55) were ligated to the short RNAs. RT/PCR was performed with3′-primer (5′-GACTAGCTGGAATTCGCGGTTAAA; SEQ ID NO:56) and 5′-primer(5′-CAGCCAACGGAATTCCTCACTAAA; SEQ ID NO:57). In order to introduce Ban Irestriction sites, a second PCR was performed using the primer pair5′-CAGCCAACAGGCACCGAATTCCTCACTAAA (SEQ ID NO:57) and5′-GACTAGCTTGGTGCCGAATTCGCGGTTAAA (SEQ ID NO:56), followed byconcatamerization after Ban I digestion and T4 DNA ligation. Concatamersof 400 to 600 basepairs were cut out from 1.5% agarose gels andrecovered by Biotrap (Schleicher & Schuell) electroelution (1×TAEbuffer) and by ethanol precipitation. Subsequently, the 3′ ends of theconcatamers were filled in by incubating for 15 min at 72° C. with Taqpolymerase in standard PCR reaction mixture. This solution was diluted3-fold with water and directly used for ligation into pCR2.1 TOPOvectors. Clones were screened for inserts by PCR and 30 to 50 sampleswere subjected to sequencing. Because RNA was prepared from combiningtissues of several mice, minor sequence variations that were detectedmultiple times in multiple clones may reflect polymorphisms rather thanRT/PCR mutations. Public database searching was used to identify thegenomic sequences encoding the approx. 21-nt RNAs. The occurrence of a20 to 30 basepair fold-back structure involving the immediate upstreamor downstream flanking sequences was used to assign miRNAs [36-38].

We examined 9 different mouse tissues and identified 34 novel miRNAs,some of which are highly tissue-specifically expressed (Table 3 and FIG.5). Furthermore, we identified 33 new miRNAs from different mousetissues and also from human Soas-2 osteosarcoma cells (Table 4). miR-1was previously shown by Northern analysis to be strongly expressed inadult heart, but not in brain, liver, kidney, lung or colon [37]. Herewe show that miR-1 accounts for 45% of all mouse miRNAs found in heart,yet miR-1 was still expressed at a low level in liver and midbrain eventhough it remained undetectable by Northern analysis. Three copies orpolymorphic alleles of miR-1 were found in mice. The conservation oftissue-specific miR-1 expression between mouse and human providesadditional evidence for a conserved regulatory role of this miRNA. Inliver, variants of miR-122 account for 72% of all cloned miRNAs andmiR-122 was undetected in all other tissues analyzed. In spleen, miR-143appeared to be most abundant, at a frequency of approx. 30%. In colon,miR-142-as, was cloned several times and also appeared at a frequency of30%. In small intestine, too few miRNA sequences were obtained to permitstatistical analysis. This was due to strong RNase activity in thistissue, which caused significant breakdown of abundant non-coding RNAs,e.g. rRNA, so that the fraction of miRNA in the cloned sequences wasvery low. For the same reason, no miRNA sequences were obtained frompancreas.

To gain insights in neural tissue miRNA distribution, we analyzedcortex, cerebellum and midbrain. Similar to heart, liver and smallintestine, variants of a particular miRNA, miR-124, dominated andaccounted for 25 to 48% of all brain miRNAs. miR-101, -127, -128, -131,and -132, also cloned from brain tissues, were further analyzed byNorthern blotting and shown to be predominantly brain-specific. Northernblot analysis was performed as described in Example 1. tRNAs and 5S rRNAwere detected by ethidium staining of polyacrylamide gels prior totransfer to verify equal loading. Blots were stripped by boiling indeionized water for 5 min, and reprobed up to 4 times until the 21-ntsignals became too weak for detection.

miR-125a and miR-125b are very similar to the sequence of C. eleganslin-4 stRNA and may represent its orthologs (FIG. 6A). This is of greatinterest because, unlike let-7 that was readily detected in otherspecies, lin-4 has acquired a few mutations in the central region andthus escaped bioinformatic database searches. Using the mouse sequencemiR-125b, we could readily identify its ortholog in the D. melanogastergenome. miR-125a and miR-125b differ only by a central diuridineinsertion and a U to C change. miR-125b is very similar to lin-4 stRNAwith the differences located only in the central region, which ispresumed to be bulged out during target mRNA recognition [41]. miR-125aand miR-125b were cloned from brain tissue, but expression was alsodetected by Northern analysis in other tissues, consistent with the rolefor lin-4 in regulating neuronal remodeling by controlling lin-14expression [43]. Unfortunately, orthologs to C. elegans lin-14 have notbeen described and miR-125 targets remain to be identified in D.melanogaster or mammals. Finally, miR-125b expression is alsodevelopmentally regulated and only detectable in pupae and adult but notin embryo or larvae of D, melanogaster (FIG. 6B).

Sequence comparison of mouse miRNAs with previously described miRNAreveals that miR-99b and miR-99a are similar to D. melanogaster, mouseand human miR-10 as well as C. elegans miR-51 [36], miR-141 is similarto D. melanogaster miR-8, miR-29b is similar to C. elegans miR-83, andmiR-131 and miR-142-s are similar to D. melanogaster miR-4 and C.elegans miR-79 [36]. miR-124a is conserved between invertebrates andvertebrates. In this respect it should be noted that for almost everymiRNA cloned from mouse was also encoded in the human genome, andfrequently detected in other vertebrates, such as the pufferfish, Fugurubripes, and the zebrafish, Danio rerio. Sequence conservation maypoint to conservation in function of these miRNAs. Comprehensiveinformation about orthologous sequences is listed in FIG. 7.

In two cases both strands of miRNA precursors were cloned (Table 3),which was previously observed once for a C. elegans miRNA (361. It isthought that the most frequently cloned strand of a miRNA precursorrepresents the functional miRNA, which is miR-30c-s and miR-142-as, sand as indicating the 5′ or 3′ side of the fold-back structure,respectively.

The mir-142 gene is located on chromosome 17, but was also found at thebreakpoint junction of a t(8;17) translocation, which causes anaggressive B-cell leukemia due to strong up-regulation of a translocatedMYC gene [44]. The translocated MYC gene, which was also truncated atthe first exon, was located only 4-nt downstream of the 3′-end of themiR-142 precursor. This suggests that translocated MYC was under thecontrol of the upstream miR-142 promoter. Alignment of mouse and humanmiR-142 containing EST sequences indicate an approximately 20 ntconserved sequence element downstream of the mir-142 hairpin. Thiselement was lost in the translocation. It is conceivable that theabsence of the conserved downstream sequence element in the putativemiR-142/mRNA fusion prevented the recognition of the transcript as amiRNA precursor and therefore may have caused accumulation of fusiontranscripts and overexpression of MYC.

miR-155, which was cloned from colon, is excised from the knownnoncoding BIC RNA [47]. BIC was originally identified as a genetranscriptionally activated by promoter insertion at a common retroviralintegration site in B cell lymphomas induced by avian leukosis virus.Comparison of BIC cDNAs from human, mouse and chicken revealed 78%identity over 138 nucleotides [47]. The identity region covers themiR-155 fold-back precursor and a few conserved boxes downstream of thefold-back sequence. The relatively high level of expression of BIC inlymphoid organs and cells in human, mouse and chicken implies anevolutionary conserved function, but BIC RNA has also been detected atlow levels in non-hematopoietic tissues [47].

Another interesting observation was that segments of perfectcomplementarity to miRNAs are not observed in mRNA sequences or ingenomic sequences outside the miRNA inverted repeat. Although this couldbe fortuitous, based on the link between RNAi and miRNA processing [11,13, 43] it may be speculated that miRNAs retain the potential to cleaveperfectly complementary target RNAs. Because translational controlwithout target degradation could provide more flexibility it may bepreferred over mRNA degradation.

In summary, 63 novel miRNAs were identified from mouse and 4 novelmiRNAs were identified from human Soas-2 osteosarcoma cells (Table 3 andTable 4), which are conserved in human and often also in othernon-mammalian vertebrates. A few of these miRNAs appear to be extremelytissue-specific, suggesting a critical role for some miRNAs intissue-specification and cell lineage decisions. We may have alsoidentified the fruitfly and mammalian ortholog of C. elegans lin-4stRNA. The establishment of a comprehensive list of miRNA sequences willbe instrumental for bioinformatic approaches that make use of completedgenomes and the power of phylogenetic comparison in order to identifymiRNA-regulated target mRNAs.

REFERENCES AND NOTES

-   1. R. C. Lee, R. L. Feinbaum, V. Ambros, Cell 75, 843 (1993).-   2. B. J. Reinhart et al., Nature 403, 901 (2000).-   3. V. Ambros, Curr. Opin. Genet. Dev. 10, 428 (2000).-   4. E. G. Moss, Curr. Biol. 10, R436 (2000).-   5. F. Slack, G. Ruvkun, Annu. Rev. Genet. 31, 611 (1997).-   6. A. E. Pasquinelli et al., Nature 408, 86 (2000).-   7. S. M. Elbashir et al., Nature 411, 494 (2001).-   8. S. M. Elbashir, W. Lendeckel, T. Tuschl, Genes & Dev. 15, 188    (2001).-   9. A. J. Hamilton, D. C. Baulcombe, Science 286, 950 (1999).-   10. S. M. Hammond, E. Bernstein, D. Beach, G. J. Hannon, Nature 404,    293 (2000).-   11. P. D. Zamore, T. Tuschl, P. A. Sharp, D. P. Bartel, Cell 101, 25    (2000).-   12. G. Hutvágner, J. McLachlan, É. Bálint, T. Tuschl, P. D. Zamore,    Science 93, 834 (2001).-   13. A. Grishok at al., Cell 106, 23 (2001).-   14. Cloning of 19- to 24-nt RNAs from D. melanogaster 0-2 h embryo    lysate was performed as described (8). For cloning of HeLa miRNAs, 1    mg of HeLa total RNA was separated on a 15% denaturing    polyacrylamide gel and RNA of 19- to 25-nt size was recovered. A 5′    phosphorylated 3′ adapter oligonucleotide (5′ pUUU-aaccgcgaattccagx:    uppercase, RNA; lowercase, DNA; p, phosphate; x,    4-hydroxymethylbenzyl; SEQ ID NO:54) and a 5′ adapter    oligonucleotide (5′ acggaattcctcactAAA: uppercase, RNA; lowercase,    DNA; SEQ ID NO:55) were ligated to the short HeLa cell RNAs. RT/PCR    was performed with 3′ primer (5′ GACTAGCTGGAATTCGCGGTTAAA; SEQ ID    NO:56) and 5′ primer (5′ CAGCCAACGGAATTCCTCACTAAA; SEQ ID NO:57),    and followed by concatamerization after Eco RI digestion and T4 DNA    ligation (8). After ligation of concatamers into pCR2.1 TOPO    vectors, about 100 clones were selected and subjected to sequencing.-   15. I. Schneider, J Embryol Exp Morphol 27, 353 (1972).-   16. R. Feinbaum, V. Ambros, Dev. Biol. 210, 87 (1999).-   17. S. M. Hammond, S. Boettcher, A. A. Caudy, R. Kobayashi, G. J.    Hannon, Science 293, 1146 (2001).-   18. A. A. Aravin et al., Curr. Biol. 11, 1017 (2001).-   19. H. Tabara et al., Cell 99, 123 (1999).-   20. M. Fagard, S. Boutet, J. B. Morel, C. Bellini, H. Vaucheret,    Proc. Natl. Acad. Sci. USA 97, 11650 (2000).-   21. C. Catalanotto, G. Azzalin, G. Macino, C. Cogoni, Nature 404,    245 (2000).    22. S. R. Eddy, Curr. Opin. Genet. Dev. 9, 695 (1999).-   23. J. Cavaille et al., Proc. Natl. Acad. Sci. USA 97, 14311 (2000).-   24. A. Hüttenhofer at al., EMBO J. 20, 2943 (2001).-   25. L. Argaman et al., Curr. Biol. 11, 941 (2001).-   26. K. M. Wassarman, F. Repoila, C. Rosenow, G. Storz, S. Gottesman,    Genes & Dev. 15, 1637 (2001).-   27. Supplementary Web material is available on Science Online at    www.sciencemag.org/cgi/content/full/xxx-   28. D. H. Mathews, J. Sabina, M. Zuker, D. H. Turner, J. Mol. Biol.    288, 911 (1999).-   29. E. Bernstein, A. A. Caudy, S. M. Hammond, G. J. Hannon, Nature    409, 363 (2001).-   30. Graham, F. L. and van der Eb, A. J., (1973), Virol. 52, 456.-   31. McCutchan, J. H. and Pagano, J. S., (1968), J. Natl. Cancer    Inst. 41, 351.-   32. Chu, G. et al., (1987), Nucl. Acids Res. 15, 1311.-   33. Fraley, R. et al., (1980), J. Biol. Chem. 255, 10431.-   34. Capecchi, M. R., (1980), Cell 22, 479.-   35. Feigner, P. L. et al., (1987), Proc. Natl. Acad. Sci. USA 84,    7413.-   36. Lau N. C., Lim L. P., Weinstein E. G., Bartel D. P., (2001),    Science 294, 858-862.-   37. Lee R. C., Ambros V., (2001), Science 294, 862-864.-   38. Ambros V., (2001), Cell 107, 823-826.-   39. Ambros V., Horvitz H. R., (1984), Science 226, 409-416.-   40. Wightman B., Ha I., Ruvkun G., (1993), Cell 75, 855-862.-   41. Rougvie A. E., (2001), Nat. Rev. Genet. 2, 690-701.-   42. Ketting R. F., Fischer S. E., Bernstein E., Sijen T., Hannon G.    J., Plasterk R. H., (2001), Genes & Dev. 15, 2654-2659.-   43. Hallam S. J., Jin Y., (1998), Nature 395, 78-82.-   44. Gauwerky C. E., Huebner K., Isobe M., Nowell P. C., Croce C. M.,    (1989), Proc. Natl. Acad. Sci. USA 86, 8867-8871.-   45. P. Chomczynski, N. Sacchi, Anal Biochem 162, 156, (1987).-   46. Mourelatos Z., Dostie J., Paushkin S., Sharma A., Charroux B.,    Abel L., J. R., Mann M., Dreyfuss G., (2002), Genes & Dev., in    press.-   47. Tam W., (2001), Gene 274, 157-167.

TABLE 1 D. melanogaster miRNAs. The sequences given represent the mostabundant, and typically longest miRNA sequence identified by cloning;miRNAs frequently vary in length by one or two nucleotides at their 3′termini. From 222 short RNAs sequenced, 69 (31%) corresponded to miRNAs,103 (46%) to already characterized functional RNAs (rRNA, 7SL RNA,tRNAs), 30 (14%) to transposon RNA fragments, and 20 (10%) sequenceswith no database entry. The frequency (freq.) for cloning a particularmiRNA relative to all identified miRNAs is indicated in percent. Resultsof Northern blotting of total RNA isolated from staged populations of D.melanogaster are summarized. E, embryo; L, larval stage; P, pupae; A,adult; S2, Schneider-2 cells. The strength of the signal within eachblot is represented from strongest (+++) to undetected (−). let-7 stRNAwas probed as control. Genbank accession numbers and homologs of miRNAsidentified by database searching in other species are provided assupplementary material. freq. E E L1 + miRNA sequence (5′ to 3′) (%) 0-3h 0-6 h L2 L3 P A S2 miR-1 UGGAAUGUAAAGAAGUAUGGAG 32 + + +++ +++ ++ +++− (SEQ ID NO: 58) miR-2a* UAUCACAGCCAGCUUUGAUGAGC 3 (SEQ ID NO: 59)miR-2b* UAUCACAGCCAGCUUUGAGGAGC 3 ++ ++ ++ +++ ++ + +++ (SEQ ID NO: 60)miR-3 UCACUGGGCAAAGUGUGUCUCA# 9 +++ +++ − − − − − miR-4AUAAAGCUAGACAACCAUUGA 6 +++ +++ − − − − − (SEQ ID NO: 62) miR-5AAAGGAACGAUCGUUGUGAUAUG 1 +++ +++ +/− +/− − − − (SEQ ID NO: 63) miR-6UAUCACAGUGGCUGUUCUUUUU 13 +++ ++ +/− +/− − − − (SEQ ID NO: 64) miR-7UGGAAGACUAGUGAUUUUUGUUGU 4 +++ ++ +/− +/− +/− +/− +/− (SEQ ID NO: 65)miR-8 UAAUACUGUCAGGUAAAGAUGUC 3 +/− +/− +++ +++ + +++ − (SEQ ID NO: 66)miR-9 UCUUUGGUUAUCUAGCUGUAUGA 7 +++ ++ +++ +++ +++ +/− − (SEQ ID NO: 67)miR-10 ACCCUGUAGAUCCGAAUUUGU 1 + + ++ +++ +/− + − (SEQ ID NO: 68) miR-11CAUCACAGUCUGAGUUCUUGC 7 +++ +++ +++ +++ +++ + − (SEQ ID NO: 69) miR-12UGAGUAUUACAUCAGGUACUGGU 7 + + ++ ++ + +++ +/− (SEQ ID NO: 70) miR-13a*UAUCACAGCCAUUUUGACGAGU 1 +++ +++ +++ +++ + +++ +++ (SEQ ID NO: 71)miR-13b* UAUCACAGCCAUUUUGAUGAGU 0 (SEQ ID NO: 72) miR-14UCAGUCUUUUUCUCUCUCCUA 1 − − − − − − − (SEQ ID NO: 73) let-7UGAGGUAGUAGGUUGUAUAGUU 0 − − − − +++ +++ − (SEQ ID NO: 74) #= (SEQ IDNO: 61) *Similar miRNA sequences are difficult to distinguish byNorthern blotting because of potential cross-hybridization of probes.

TABLE 2 Human miRNAs. From 220 short RNAs sequenced, 100 (45%)corresponded to miRNAs, 53 (24%) to already characterized functionalRNAs (rRNA, snRNAs, tRNAs), and 67 (30%) sequences with no databaseentry. Results of Northern blotting of total RNA isolated from differentvertebrate species and S2 cells are indicated. For legend, see Table 1.freq. HeLa mouse adult frog miRNA sequence (5′ to 3′) (%) cells kidneyfish ovary S2 let-7a* UGAGGUAGUAGGUUGUAUAGUU# 10 +++ +++ +++ − − let-7b*UGAGGUAGUAGGUUGUGUGGUU 13 (SEQ ID NO: 76) let-7c* UGAGGUAGUAGGUUGUAUGGUU3 (SEQ ID NO: 77) let-7d* AGAGGUAGUAGGUUGCAUAGU 2 +++ +++ +++ − − (SEQID NO: 78) let-7e* UGAGGUAGGAGGUUGUAUAGU 2 +++ +++ +++ − − (SEQ ID NO:79) let-7f* UGAGGUAGUAGAUUGUAUAGUU 1 (SEQ ID NO: 80) miR-15UAGCAGCACAUAAUGGUUUGUG 3 +++ ++ + +/− − (SEQ ID NO: 81) miR-16UAGCAGCACGUAAAUAUUGGCG 10 +++ + +/− +/− − (SEQ ID NO: 82) miR-17ACUGCAGUGAAGGCACUUGU 1 +++ − − − − (SEQ ID NO: 83) miR-18UAAGGUGCAUCUAGUGCAGAUA 2 +++ − − − − (SEQ ID NO: 84) miR-19a*UGUGCAAAUCUAUGCAAAACUGA 1 +++ − +/− − − (SEQ ID NO: 85) miR-19b*UGUGCAAAUCCAUGCAAAACUGA 3 (SEQ ID NO: 86) miR-20 UAAAGUGCUUAUAGUGCAGGUA4 +++ − + − − (SEQ ID NO: 87) miR-21 UAGCUUAUCAGACUGAUGUUGA 10 +++ + ++− − (SEQ ID NO: 88) miR-22 AAGCUGCCAGUUGAAGAACUGU 10 +++ +++ + +/− −(SEQ ID NO: 89) miR-23 AUCACAUUGCCAGGGAUUUCC 2 +++ +++ +++ + − (SEQ IDNO: 90) miR-24 UGGCUCAGUUCAGCAGGAACAG 4 ++ +++ ++ − − (SEQ ID NO: 91)miR-25 CAUUGCACUUGUCUCGGUCUGA 3 +++ + ++ − − (SEQ ID NO: 92) miR-26a*UUCAAGUAAUCCAGGAUAGGCU 2 + ++ +++ − − (SEQ ID NO: 93) miR-26b*UUCAAGUAAUUCAGGAUAGGUU 1 − (SEQ ID NO: 94) miR-27 UUCACAGUGGCUAAGUUCCGCU2 +++ +++ ++ − − (SEQ ID NO: 95) miR-28 AAGGAGCUCACAGUCUAUUGAG 2 +++ +++− − − (SEQ ID NO: 96) miR-29 CUAGCACCAUCUGAAAUCGGUU 2 + +++ +/− − − (SEQID NO: 97) miR-30 CUUUCAGUCGGAUGUUUGCAGC 2 +++ +++ +++ − − (SEQ ID NO:98) miR-31 GGCAAGAUGCUGGCAUAGCUG 2 +++ − − − − (SEQ ID NO: 99) miR-32UAUUGCACAUUACUAAGUUGC 1 − − − − − (SEQ ID NO: 100) miR-33GUGCAUUGUAGUUGCAUUG 1 − − − − − (SEQ ID NO: 101) miR-1UGGAAUGUAAAGAAGUAUGGAG 0 − − + − − (SEQ ID NO: 102) miR-7UGGAAGACUAGUGAUUUUGUUGU 0 + − +/− − +/− (SEQ ID NO: 103) miR-9UCUUUGGUUAUCUAGCUGUAUGA 0 − − − − − (SEQ ID NO: 104) miR-10ACCCUGUAGAUCCGAAUUUGU 0 − + − − − (SEQ ID NO: 105) #= (SEQ ID NO: 75)*Similar miRNA sequences are difficult to distinguish by Northernblotting because of potential cross-hybridization of probes.

TABLE 3 Mouse miRNAs. The sequences indicated represent the longestmiRNA sequences identified by cloning. The 3′-terminus of miRNAs isoften truncated by one or two nucleotides. miRNAs that are more than 85%identical in sequence (i.e. share 18 out of 21 nucleotides) or contain1- or 2-nucleotide internal deletions are referred to by the same genenumber followed by a lowercase letter. Minor sequence variations betweenrelated miRNAs are generally found near the ends of the miRNA sequenceand are thought to not compromise target RNA recognition. Minor sequencevariations may also represent A to G and C to U changes, which areaccommodated as G-U wobble base pairs during target recognition. miRNAswith the suffix -s or -as indicate RNAs derived from either the 5′-halfor the 3′-half of a miRNA precursor. Mouse brains were dissected intomidbrain, mb, cortex, cx, cerebellum, cb. The tissues analyzed wereheart, ht; liver, lv; small intestine, si; colon, co; cortex, ct;cerebellum, cb; midbrain, mb. Number of clones miRNA sequence (5′ to 3′)ht lv sp si co cx cb mb let-7a UGAGGUAGUAGGUUGUAUAGUU 3 1 1 7 (SEQ IDNO: 106) let-7b UGAGGUAGUAGGUUGUGUGGUU 1 1 2 5 (SEQ ID NO: 107) let-7cUGAGGUAGUAGGUUGUAUGGUU 2 2 5 19 (SEQ ID NO: 108) let-7dAGAGGUAGUAGGUUGCAUAGU 2 2 2 2 (SEQ ID NO: 109) let-7eUGAGGUAGGAGGUUGUAUAGU 1 2 (SEQ ID NO: 110) let-7f UGAGGUAGUAGAUUGUAUAGUU2 3 3 (SEQ ID NO: 111) let-7g UGAGGUAGUAGUUUGUACAGUA 1 1 2 (SEQ ID NO:112) let-7h UGAGGUAGUAGUGUGUACAGUU 1 1 (SEQ ID NO: 113) let-7iUGAGGUAGUAGUUUGUGCU 1 1 (SEQ ID NO: 114) miR-1b UGGAAUGUAAAGAAGUAUGUAA 42 1 (SEQ ID NO: 115) miR-1c UGGAAUGUAAAGAAGUAUGUAC 7 (SEQ ID NO: 116)miR-1d UGGAAUGUAAAGAAGUAUGUAUU 16 1 (SEQ ID NO: 117) miR-9UCUUUGGUUAUCUAGCUGUAUGA 3 4 4 (SEQ ID NO: 118) miR-15aUAGCAGCACAUAAUGGUUUGUG 1 2 (SEQ ID NO: 119) miR-15bUAGCAGCACAUCAUGGUUUACA 1 (SEQ ID NO: 120) miR-16 UAGCAGCACGUAAAUAUUGGCG1 1 2 1 2 3 (SEQ ID NO: 121) miR-18 UAAGGUGCAUCUAGUGCAGAUA 1 (SEQ ID NO:122) miR-19b UGUGCAAAUCCAUGCAAAACUGA 1 (SEQ ID NO: 123) miR-20UAAAGUGCUUAUAGUGCAGGUAG 1 (SEQ ID NO: 124) miR-21 UAGCUUAUCAGACUGAUGUUGA1 1 2 1 (SEQ ID NO: 125) miR-22 AAGCUGCCAGUUGAAGAACUGU 2 1 1 1 2 (SEQ IDNO: 126) miR-23a AUCACAUUGCCAGGGAUUUCC 1 (SEQ ID NO: 127) miR-23bAUCACAUUGCCAGGGAUUACCAC 1 (SEQ ID NO: 128) miR-24 UGGCUCAGUUCAGCAGGAACAG1 1 1 1 (SEQ ID NO: 129) miR-26a UUCAAGUAAUCCAGGAUAGGCU 3 2 (SEQ ID NO:130) miR-26b UUCAAGUAAUUCAGGAUAGGUU 2 4 1 (SEQ ID NO: 131) miR-27aUUCACAGUGGCUAAGUUCCGCU 1 2 1 1 2 1 (SEQ ID NO: 132) miR-27bUUCACAGUGGCUAAGUUCUG 1 (SEQ ID NO: 133) miR-29a CUAGCACCAUCUGAAAUCGGUU 11 1 (SEQ ID NO: 134) miR-29b/miR-102 UAGCACCAUUUGAAAUCAGUGUU 1 1 5 3(SEQ ID NO: 135) miR-29c/ UAGCACCAUUUGAAAUCGGUUA 1 3 1 (SEQ ID NO: 136)miR-30a-s/miR-97 UGUAAACAUCCUCGACUGGAAGC 1 1 1 (SEQ ID NO: 137)miR-30a-as^(a) CUUUCAGUCGGAUGUUUGCAGC 1 (SEQ ID NO: 138) miR-30bUGUAAACAUCCUACACUCAGC 1 2 (SEQ ID NO: 139) miR-30cUGUAAACAUCCUACACUCUCAGC 2 1 1 (SEQ ID NO: 140) miR-30dUGUAAACAUCCCCGACUGGAAG 1 (SEQ ID NO: 141) miR-99a/miR-99ACCCGUAGAUCCGAUCUUGU 1 (SEQ ID NO: 142) miR-99b CACCCGUAGAACCGACCUUGCG 1(SEQ ID NO: 143) miR-101 UACAGUACUGUGAUAACUGA 2 1 1 (SEQ ID NO: 144)miR-122a UGGAGUGUGACAAUGGUGUUUGU 3 (SEQ ID NO: 145) miR-122bUGGAGUGUGACAAUGGUGUUUGA 11 (SEQ ID NO: 146) miR-122a,bUGGAGUGUGACAAUGGUGUUUG 23 (SEQ ID NO: 147) miR-123 CAUUAUUACUUUUGGUACGCG1 2 (SEQ ID NO: 148) miR-124a^(b) UUAAGGCACGCGG-UGAAUGCCA 1 37 41 24(SEQ ID NO: 149) miR-124b UUAAGGCACGCGGGUGAAUGC 1 3 (SEQ ID NO: 150)miR-125a UCCCUGAGACCCUUUAACCUGUG 1 1 (SEQ ID NO: 151) miR-125bUCCCUGAGACCCU--AACUUGUGA 1 (SEQ ID NO: 152) miR-126UCGUACCGUGAGUAAUAAUGC 4 1 (SEQ ID NO: 153) miR-127UCGGAUCCGUCUGAGCUUGGCU 1 (SEQ ID NO: 154) miR-128 UCACAGUGAACCGGUCUCUUUU2 2 2 (SEQ ID NO: 155) miR-129 CUUUUUUCGGUCUGGGCUUGC 1 (SEQ ID NO: 156)miR-130 CAGUGCAAUGUUAAAAGGGC 1 (SEQ ID NO: 157) miR-131UAAAGCUAGAUAACCGAAAGU 1 1 1 (SEQ ID NO: 158) miR-132UAACAGUCUACAGCCAUGGUCGU 1 (SEQ ID NO: 159) miR-133UUGGUCCCCUUCAACCAGCUGU 4 1 (SEQ ID NO: 160) miR-134UGUGACUGGUUGACCAGAGGGA 1 (SEQ ID NO: 151) miR-135UAUGGCUUUUUAUUCCUAUGUGAA 1 (SEQ ID NO: 162) miR-136ACUCCAUUUGUUUUGAUGAUGGA 1 (SEQ ID NO: 163) miR-137UAUUGCUUAAGAAUACGCGUAG 1 1 (SEQ ID NO: 164) miR-138 AGCUGGUGUUGUGAAUC 1(SEQ ID NO: 165) miR-139 UCUACAGUGCACGUGUCU 1 1 (SEQ ID NO: 166) miR-140AGUGGUUUUACCCUAUGGUAG 1 (SEQ ID NO: 167) miR-141 AACACUGUCUGGUAAAGAUGG 11 1 (SEQ ID NO: 168) miR-142-s CAUAAAGUAGAAAGCACUAC 1 1 (SEQ ID NO: 169)miR-142-as^(b) UGUAGUGUUUCCUACUUUAUGG 1 1 6 (SEQ ID NO: 170) miR-143UGAGAUGAAGCACUGUAGCUCA 3 7 2 1 (SEQ ID NO: 171) miR-144UACAGUAUAGAUGAUGUACUAG 2 1 (SEQ ID NO: 172) miR-145GUCCAGUUUUCCCAGGAAUCCCUU 1 (SEQ ID NO: 173) miR-146UGAGAACUGAAUUCCAUGGGUUU 1 (SEQ ID NO: 174) miR-147 GUGUGUGGAAAUGCUUCUGCC1 (SEQ ID NO: 175) miR-148 UCAGUGCACUACAGAACUUUGU 1 (SEQ ID NO: 176)miR-149 UCUGGCUCCGUGUCUUCACUCC 1 (SEQ ID NO: 177) miR-150UCUCCCAACCCUUGUACCAGUGU 1 (SEQ ID NO: 178) miR-151CUAGACUGAGGCUCCUUGAGGU 1 (SEQ ID NO: 179) miR-152 UCAGUGCAUGACAGAACUUGG1 (SEQ ID NO: 180) miR-153 UUGCAUAGUCACAAAAGUGA 1 (SEQ ID NO: 181)miR-154 UAGGUUAUCCGUGUUGCCUCG 1 (SEQ ID NO: 182) miR-155UUAAUGCUAAUUGUGAUAGGGG 1 (SEQ ID NO: 183) ^(a)The originally describedmiR-30 was renamed to miR-30a-as in order to distinguish it from themiRNA derived from the opposite strand of the precursor encoded by themir-30a gene. miR-30a-s is equivalent to miR-97 [46]. ^(b)A 1-nt lengthheterogeneity is found on both 5′ and 3′ end. The 22-nt miR sequence isshown, but only 21-nt miRNAs were cloned.

TABLE 4 Mouse and human miRNAs. The sequences indicated represent thelongest miRNA sequences identified by cloning. The 3′ terminus of miRNAsis often truncated by one or two nucleotides. miRNAs that are more than85% identical in sequence (i.e. share 18 out of 21 nucleotides) orcontain 1- or 2-nucleotide internal deletions are referred to by thesame gene number followed by a lowercase letter. Minor sequencevariations between related miRNAs are generally found near the ends ofthe miRNA sequence and are thought to not compromise target RNArecognition. Minor sequence variations may also represent A to G and Cto U changes; which are accommodated as G-U wobble base pairs duringtarget recognition. Mouse brains were dissected into midbrain, mb,cortex, cx, cerebellum, cb. The tissues analyzed were lung, ln; liver,lv; spleen, sp; kidney, kd; skin, sk; testis, ts; ovary, ov; thymus,thy; eye, ey; cortex, ct; cerebellum, cb; midbrain, mb. The humanosteosarcoma cells SAOS-2 cells contained an inducible p53 gene (p53−,uninduced p53; p53+, induced p53); the differences in miRNAs identifiedfrom induced and uninduced SAOS cells were not statisticallysignificant. number of clones human SAOS- mouse tissues 2 cells miRNASequence (5′ to 3′) ln lv sp kd sk ts ov thy ey p53− p53+ miR-C1AACAUUCAACGCUGUCGGUGAGU 1 1 2 (SEQ ID NO. 184) miR-C2UUUGGCAAUGGUAGAACUCACA 1 (SEQ ID NO. 185) miR-C3 UAUGGCACUGGUAGAAUUCACUG1 (SEQ ID NO. 186) miR-C4 CUUUUUGCGGUCUGGGCUUGUU 1 1 1 (SEQ ID NO. 187)miR-C5 UGGACGGAGAACUGAUAAGGGU 2 (SEQ ID NO. 188) miR-C6UGGAGAGAAAGGCAGUUC 1 (SEQ ID NO. 189) miR-C7 CAAAGAAUUCUCCUUUUGGGCUU 1 1(SEQ ID NO. 190) miR-C8 UCGUGUCUUGUGUUGCAGCCGG 1 (SEQ ID NO. 191) miR-C9UAACACUGUCUGGUAACGAUG 1 (SEQ ID NO. 192) miR-C10 CAUCCCUUGCAUGGUGGAGGGU1 (SEQ ID NO. 193) miR-C11 GUGCCUACUGAGCUGACAUCAGU 1 (SEQ ID NO. 194)miR-C12 UGAUAUGUUUGAUAUAUUAGGU 2 (SEQ ID NO. 195) miR-C13CAACGGAAUCCCAAAAGCAGCU 2 1 (SEQ ID NO. 196) miR-C14 CUGACCUAUGAAUUGACA 21 (SEQ ID NO. 197) miR-C15 UACCACAGGGUAGAACCACGGA 1 (SEQ ID NO. 198)miR-C16 AACUGGCCUACAAAGUCCCAG 1 (SEQ ID NO. 199) miR-C17UGUAACAGCAACUCCAUGUGGA 1 (SEQ ID NO. 200) miR-C18 UAGCAGCACAGAAAUAUUGGC2 1 1 (SEQ ID NO. 201) miR-C19 UAGGUAGUUUCAUGUUGUUGG 1 (SEQ ID NO. 202)miR-C20 UUCACCACCUUCUCCACCCAGC 1 1 (SEQ ID NO. 203) miR-C21GGUCCAGAGGGGAGAUAGG 1 (SEQ ID NO. 204) miR-C22 CCCAGUGUUCAGACUACCUGUU 1(SEQ ID NO. 205) miR-C23 UAAUACUGCCUGGUAAUGAUGAC 2 1 (SEQ ID NO. 206)miR-C24 UACUCAGUAAGGCAUUGUUCU 1 (SEQ ID NO. 207) miR-C25AGAGGUAUAGCGCAUGGGAAGA 1 (SEQ ID NO. 208) miR-C26 UGAAUGUUUAGGACCACUAG 1(SEQ ID NO. 209) miR-C27 UUCCCUUUGUCAUCCUAUGCCUG 1 (SEQ ID NO. 210)miR-C28 UCCUUCAUUCCACCGGAGUCUG 1 (SEQ ID NO. 211) miR-C29GUGAAAUGUUUAGGACCACUAGA 2 (SEQ ID NO. 212) miR-C30UGGAAUGUAAGGAAGUGUGUGG 2 (SEQ ID NO. 213) miR-C31 UACAGUAGUCUGCACAUUGGUU1 (SEQ ID NO. 214) miR-C32 CCCUGUAGAACCGAAUUUGUGU 1 1 (SEQ ID NO. 215)miR-C33 AACCCGUAGAUCCGAACUUGUGAA 1 (SEQ ID NO. 216) miR-C34GCUUCUCCUGGCUCUCCUCCCUC 1 (SEQ ID NO. 217)

TABLE 5 D. melanogaster miRNA sequences and genomic location. Thesequences given represent the most abundant, and typically longest miRNAsequences identified by cloning. It was frequently observed that miRNAsvary in length by one or two nucleotides at their 3′-terminus. From 222short RNAs sequenced; 69 (31%) corresponded to miRNAs, 103 (46%) toalready characterized functional RNAs (rRNA, 7SL RNA, tRNAs), 30 (14%)to transposon RNA fragments, and 20 (10%) sequences with no databaseentry. RNA sequences with a 5′- guanosine are likely to beunderrepresented due to the cloning procedure (8). miRNA homologs foundin other species are indicated. Chromosomal location (chr.) and GenBankaccession numbers (acc. nb.) are indicated. No ESTs matching miR-1 tomiR-14 were detectable by database searching. miRNA sequence (5′ to 3′)chr., acc. nb. remarks miR-1 UGGAAUGUAAAGAAGUAUGGAG 2L, AE003667homologs: C. briggsae, G20U, (SEQ ID NO: 58) AC87074; C. elegans G20U,U97405; mouse, G20U, G22U, AC020867; human, chr. 20, G20U, G22U,AL449263; ESTs: zebrafish, G20U, G22U, BF157- 601; cow, G20U, G22U,BE722- 224; human, G20U, G22U, AI220268 miR-2a UAUCACAGCCAGCUUUGAUGAGC2L, AE003663 2 precursor variants clustered (SEQ ID NO: 59) with a copyof mir-2b miR-2b UAUCACAGCCAGCUUUGAGGAGC 2L, AE003620 2 precursorvariants (SEQ ID NO: 60) 2L, AE003663 miR-3 UCACUGGGCAAAGUGUGUCUCA 2R,AE003795 in cluster mir-3 to mir-6 (SEQ ID NO: 61) miR-4AUAAAGCUAGACAACCAUUGA 2R, AE003795 in cluster mir-3 to mir-6 (SEQ ID NO:62) miR-5 AAAGGAACGAUCGUUGUGAUAUG 2R, AE003795 in cluster mir-3 to mir-6(SEQ ID NO: 63) miR-6 UAUCACAGUGGCUGUUCUUUUU 2R, AE003795 in clustermir-3 to mir-6 with 3 (SEQ ID NO: 64) variants miR-7UGGAAGACUAGUGAUUUUGUUGU 2R, AE003791 homologs: human, chr. 19 (SEQ IDNO: 65) AC006537, EST BF373391; mouse chr. 17 AC026385, EST AA881786miR-8 UAAUACUGUCAGGUAAAGAUGUC 2R, AE003805 (SEQ ID NO: 66) miR-9UCUUUGGUUAUCUAGCUGUAUGA 3L, AE003516 homologs: mouse, chr. 19, (SEQ IDNO: 67) AF155142; human, chr. 5, AC026701, chr. 15, AC005316 miR-10ACCCUGUAGAUCCGAAUUUGU AE001574 homologs: mouse, chr 11, (SEQ ID NO: 68)AC011194; human, chr. 17, AF287967 miR-11 CAUCACAGUCUGAGUUCUUGC 3R,AE003735 intronic location (SEQ ID NO: 69) miR-12UGAGUAUUACAUCAGGUACUGGU X, AE003499 intronic location (SEQ ID NO: 70)miR-13a UAUCACAGCCAUUUUGACGAGU 3R, AE003708 mir-13a clustered withmir-13b (SEQ ID NO: 71) X, AE003446 on chr. 3R miR-13bUAUCACAGCCAUUUUGAUGAGU 3R, AE003708 mir-13a clustered with mir-13b (SEQID NO: 72) on chr. 3R miR-14 UCAGUCUUUUUCUCUCUCCUA 2R, AE003833 nosignal by Northern analysis (SEQ ID NO: 73)

TABLE 6 Human miRNA sequences and genomic location. From 220 short RNAssequenced, 100 (45%) corresponded to miRNAs, 53 (24%) to alreadycharacterized functional RNAs (rRNA, snRNAs, tRNAs), and 67 (30%)sequences with no database entry. For legend, see Table 1. chr. or EST,miRNA sequence (5′ to 3′) acc. nb. remarks* let-7aUGAGGUAGUAGGUUGUAUAGUU 9, AC007924, sequences of chr 9 and 17 (SEQ IDNO: 75) 11, AP001359, identical and clustered with let-7f, 17, AC087784,homologs: C. elegans, AF274345; 22, AL049853 C. briggsae, AF210771, D.melanogaster, AE003659 let-7b UGAGGUAGUAGGUUGUGUGGUU 22, AL049853\,homologs: mouse, EST AI481799; (SEQ ID NO: 76) ESTs, AI382133, rat, EST,BE120662 AW028822 let-7c UGAGGUAGUAGGUUGUAUGGUU 21, AP001667 Homologs:mouse, EST, (SEQ ID NO: 77) AA575575 let-7d AGAGGUAGUAGGUUGCAUAGU 17,AC087784, identical precursor sequences (SEQ ID NO: 78) 9, AC007924let-7e UGAGGUAGGAGGUUGUAUAGU 19, AC018755 (SEQ ID NO: 79) let-7fUGAGGUAGUAGAUUGUAUAGUU 9, AC007924, sequences of chr 9 and 17 (SEQ IDNO: 80) 17, AC087784, identical and clustered with let-7a X, AL592046miR-15 UAGCAGCACAUAAUGGUUUGUG 13, AC069475 in cluster with mir-16homolog (SEQ ID NO: 81) miR-16 UAGCAGCACGUAAAUAUUGGCG 13, AC069475 incluster with mir-15 homolog (SEQ ID NO: 82) miR-17 ACUGCAGUGAAGGCACUUGU13, AL138714 in cluster with mir-17 to mir-20 (SEQ ID NO: 83) miR-18UAAGGUGCAUCUAGUGCAGAUA 13, AL138714 in cluster with mir-17 to mir-20(SEQ ID NO: 84) miR-19a UGUGCAAAUCUAUGCAAAACUG 13, AL138714 in clusterwith mir-17 to mir-20 A (SEQ ID NO: 85) miR-19b UGUGCAAAUCCAUGCAAAACUG13, AL138714, in cluster with mir-17 to mir-20 A (SEQ ID NO: 86) X,AC002407 miR-20 UAAAGUGCUUAUAGUGCAGGUA 13, AL138714 in cluster withmir-17 to mir-20 (SEQ ID NO: 87) miR-21 UAGCUUAUCAGACUGAUGUUGA 17,AC004686, homologs: mouse, EST, (SEQ ID NO: 88) EST, BF326048 AA209594miR-22 AAGCUGCCAGUUGAAGAACUGU ESTs, human ESTs highly similar; (SEQ IDNO: 89) AW961681\, homologs: mouse, ESTs, e.g. AA456477, AA823029; rat,ESTs, e.g. AI752503, BF543690 BF030303, HS1242049 miR-23AUCACAUUGCCAGGGAUUUCC 19, AC020916 homologs: mouse, EST, (SEQ ID NO: 90)AW124037; rat, EST, BF402515 miR-24 UGGCUCAGUUCAGCAGGAACAG 9, AF043896,homologs: mouse, ESTs, (SEQ ID NO: 91) 19, AC020916 AA111466, AI286629;pig, EST, BE030976 miR-25 CAUUGCACUUGUCUCGGUCUGA 7, AC073842, human chr7 and EST identical; (SEQ ID NO: 92) EST, BE077684 highly similarprecursors in mouse ESTs (e.g. AI595464); fish precursor different STS:G46757 miR-26a UUCAAGUAAUCCAGGAUAGGCU 3, AP000497 (SEQ ID NO: 93)miR-26b UUCAAGUAAUUCAGGAUAGGUU 2, AC021016 (SEQ ID NO: 94) miR-27UUCACAGUGGCUAAGUUCCGCU 19, AC20916 U22C mutation in human genomic (SEQID NO: 95) sequence miR-28 AAGGAGCUCACAGUCUAUUGAG 3, AC063932 (SEQ IDNO: 96) miR-29 CUAGCACCAUCUGAAAUCGGUU 7, AF017104 (SEQ ID NO: 97) miR-30CUUUCAGUCGGAUGUUUGCAGC 6, AL035467 (SEQ ID NO: 98) miR-31GGCAAGAUGCUGGCAUAGCUG 9, AL353732 (SEQ ID NO: 99) miR-32UAUUGCACAUUACUAAGUUGC 9, AL354797 not detected by Northern blotting (SEQID NO: 100) miR-33 GUGCAUUGUAGUUGCAUUG 22, Z99716 not detected byNorthern blotting (SEQ ID NO: 101) *If several ESTs were retrieved forone organism in the database, only those with different precursorsequences are listed. \precursor structure shown in FIG. 4.

1. Isolated nucleic acid molecule comprising (a) a nucleotide sequenceas shown in Table 1, Table 2, Table 3 or Table 4 or a precursor thereofas shown in FIG. 3, FIG. 4 or FIG.
 7. (b) a nucleotide sequence which isthe complement of (a). (c) a nucleotide sequence which has an identityof at least 80% to a sequence of (a) or (b) and/or (d) a nucleotidesequence which hybridizes under stringent conditions to a sequence of(a), (b) and/or (c).
 2. The nucleic acid molecule of claim 1, whereinthe identity of sequence (c) is at least 90%.
 3. The nucleic acidmolecule of claim 1, wherein the identity of sequence (c) is at least90%.
 4. The nucleic acid molecule of claim 1, which is selected from miR1-14 as shown in Table 1 or miR 15-33 as shown in Table 2 or miR 1-155as shown in Table 3 or miR-C1-34 as shown in Table 4 or a complementthereof.
 5. The nucleic acid molecule of claim 1, which is selected frommir 1-14 as shown in FIG. 3 or let 7a-7f or mir 15-33, as shown in FIG.4 or let 7a-I or mir 1-155 or mir-c1-34, as shown in FIG. 7 or acomplement thereof.
 6. The nucleic acid molecule of claim 1 which is amiRNA molecule or an analog thereof having a length of from 18-25nucleotides.
 7. The nucleic acid molecule of claim 1, which is a miRNAprecursor molecule having a length of 60-80 nucleotides or a DNAmolecule coding therefor.
 8. The nucleic acid molecule of claim 1, whichis single-stranded.
 9. The nucleic acid molecule of claim 1, which is atleast partially double-stranded.
 10. The nucleic acid molecule of claim1, which is selected from RNA, DNA or nucleic acid analog molecules. 11.The nucleic acid molecule of claim 10, which is a molecule containing atleast one modified nucleotide analog.
 12. The nucleic molecule of claim10 which is a recombinant expression vector.
 13. A pharmaceuticalcomposition containing as an active agent at least one nucleic acidmolecule of claim 1 and optionally a pharmaceutically acceptablecarrier.
 14. The composition of claim 13 for diagnostic applications.15. The composition of claim 13 for therapeutic applications.
 16. Thecomposition of claim 13 as a marker or a modulator for developmental orpathogenic processes.
 17. The composition of claims 13 as a marker ormodulator of developmental disorders, particularly cancer, such a B-cellchronic leukemia.
 18. The composition of claim 13 as a marker ormodulator of gene expression.
 19. The composition of claim 18 as amarker or modulator of the expression of a gene, which is at leastpartially complementary to said nucleic acid molecule.
 20. A method ofidentifying microRNA molecules or precursor molecules thereof comprisingligating 5′- and 3′-adapter molecules to the ends of a size-fractionatedRNA population, reverse transcribing said adapter-containing RNApopulation and characterizing the reverse transcription products.